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Abstract — We consider a single-hop data gathering sen- 
sor cluster consisting of a set of sensor nodes that need 
to transmit data periodically to a base-station. We are 
interested in maximizing the lifetime of this network. Even 
though the setting of our problem is very simple, it turns 
out that the solution is far from easy. The complexity 
arises from several competing system-level opportunities 
that can be harnessed to reduce the energy consumed 
in radio transmission. First, sensor data in a cluster is 
spatially and temporally correlated. Recent advances in 
distributed source coding allow us to take advantage of 
these correlations to reduce the number of bits that need 
to be transmitted, with concomitant savings in energy. 
Second, it is also well known that channel coding can 
be used to reduce transmission energy by increasing 
transmission time. Finally, sensor nodes are cooperative, 
unlike nodes in an ad hoc network that are often modeled 
as competitive. This collaborative nature allows us to take 
full advantage of the first two opportunities for the purpose 
of maximizing cluster lifetime. 

In this paper, we pose the problem of maximizing 
lifetime as a max-min optimization problem subject to the 
constraint of successful data collection and limited energy 
supply at each node. This turns out to be an extremely 
difficult optimization to solve. Consequently, we employ a 
notion of instantaneous decoding to shrink the problem 
space. We show that the computational complexity of 
our model is determined by the relationship between 
energy consumption and transmission rate as well as model 
assumptions about path loss and initial energy reserves. 
We provide some algorithms, heuristics and insights for 
several scenarios. In some situations, our problem admits 
a greedy solution while in others, the problem is shown 
to be AfV-hard. The chief contribution of the paper is 
to illustrate both the challenges and gains provided by 
source-channel coding and scheduling. 

I. Introduction 

Minimizing energy consumption is one of the primary 
technical challenges in sensor networking. Many sensor 
applications such as habitat monitoring and industrial 
instrumentation envisage scenarios in which a large 



number of sensor nodes, powered by tiny batteries, will 
be actively deployed for months and even years. In many 
instances, it may not be possible to replace these sensor 
nodes once they run out of energy because the sensor 
nodes could be inaccessible (for example, embedded in 
concrete structures to sense stress levels). Replacing dead 
batteries in a sensor network consisting of a large number 
of nodes may also not be economically feasible. There 
is a now a broad consensus that aggressive system level 
strategies impacting many layers of the protocol stack 
need to be devised to meet the lifetime requirement of 
extant and future sensor networks. 

In this paper, we consider a single-hop sensor clus- 
ter. Nodes in the cluster periodically sample a field 
and transmit the data directly to a central location 
or base-station. We are interested in minimizing the 
energy spent by these nodes in transmitting, with the 
objective of maximizing cluster lifetime. Sensor nodes 
also spend energy in receiving data, sensing/actuating 
and computation. The energy spent in sensing/actuating 
represents a fixed cost that can be ignored. The energy 
cost of receiving data can be easily incorporated in our 
optimization model. We assume computation costs are 
negligible compared to radio communication costs. This 
is debatable assumption in dense networks; we intend to 
incorporate computation costs in future work. 

We believe that our model is useful because many 
popular proposals recommend organizing a sensor net- 
work into clusters [1]. Here each cluster elects a cluster- 
head (which we call base-station). Nodes communicate 
only through their respective cluster-heads. Approaches 
that maximize cluster lifetime can be thought of as 
being complementary to network-wide approaches such 
as energy-efficient routing. Moreover, our model is ap- 
plicable to scenarios where a roving base-station moves 
from one cluster to another, gathering data. 

We define cluster lifetime as the time until the first 
node in the cluster runs out of energy. While this is a 



somewhat pessimistic definition, we argue that a cluster 
will consist of relatively few nodes. The failure of 
even one such node can have disastrous consequences 
on the cluster's performance (for example, coverage). 
This definition also has the benefit of being simple and 
popular [2], [3]. Other definitions proposed for network 
lifetime such as mean expiration time and time until a 
certain fraction of nodes fail are not appealing from a 
cluster viewpoint. 

Somewhat to our surprise, we find that analyzing the 
performance of this simple model is far from trivial. 
The complexity arises from several competing system- 
level opportunities to be harnessed to reduce the energy 
consumed in radio transmission. First, sensor data in 
a cluster is spatially and temporally correlated. In [4], 
Slepian and Wolf show that it is possible to com- 
press a set of correlated sources down to their joint 
entropy, without explicit communication between the 
sources. This surprising existential result shows that 
it is enough for the sources to know the probability 
distribution of data generated^. Recent advances [5] in 
distributed source coding allow us to take advantage of 
data correlation to reduce the number of bits that need 
to be transmitted, with concomitant savings in energy. 
Second, it is also well known that channel coding [6] 
can be used to reduce transmission energy by increasing 
transmission time. Finally, sensor nodes are cooperative, 
unlike nodes in an ad hoc network that are often modeled 
as competitive. This collaborative nature allow us to 
take full advantage of the first two opportunities for the 
purpose of maximizing cluster lifetime. 

Motivated by our definition of cluster lifetime, we 
pose the problem of maximizing lifetime as a max- 
min optimization problem subject to the constraint of 
successful data collection and limited energy supply at 
each node. This turns out to be an extremely difficult 
optimization to solve. Consequently, we employ a notion 
of instantaneous decoding to shrink the problem space. 
We show that the computational complexity of our 
model is determined by the relationship between energy 
consumption and transmission rate as well as model 
assumptions about path loss and initial energy reserves. 
We provide some algorithms, heuristics, and insights for 
several scenarios. In some situations, our problem admits 
a greedy solution while in others, the problem is shown 
to be A^P-hard. The chief contribution of the paper is 
to illustrate both the challenges and gains provided by 
source-channel coding and scheduling. 

'Actually knowledge of conditional entropies suffices. 



There is much related work in this area. Energy 
conscious networking strategies have been proposed by 
many researchers mainly at the MAC [7] and routing 
layer [2], [8]— [12]. Our study was motivated by previ- 
ous research in [13], [14], which explicitly incorporate 
aggregation costs in gathering sensor data. In [14], the 
authors consider the problem of correlated data gathering 
by a network with a sink node and a tree communication 
structure. Their goal is to minimize the total transmission 
(energy) cost of transporting information. The first part 
of [13] considers a model similar to ours, namely, that 
of several correlated nodes transmitting directly to a 
base station. However, both [13] and [14] are interested 
in minimizing total energy expenditure, as opposed to 
maximizing network lifetime. In the latter case, the 
optimal solution is shown in both papers to be a greedy 
solution based on ordering sensors according to their 
distance (which reflects data aggregation cost) from the 
base station. However, we show that this solution is not 
optimal for maximizing network lifetime. An early ver- 
sion of our ideas appeared in [15]. This paper generalizes 
the work presented in [15] and provides proofs of some 
key conjectures there. 

In section [TTJ we present our system model and de- 
scribe our notion of instantaneous decoding. In section 
UTTl we consider a general channel model which allows us 
to consider the joint impact of cooperative nature of the 
sensor nodes and source and channel coding on system 
lifetime. We prove that the both, the static and dynamic 
scheduling problems for the general channel model, are 
MV-hard and the optimal dynamic scheduling strategy 
does better than optimal static scheduling strategy, in 
general. We also provide the geometric interpretation of 
the optimal solutions and the solution search procedures. 
As a special case of this problem, in section ITVl we 
consider a scenario which allows us to neglect the impact 
of transmission time allocation. This is similar to the 
scenario considered in [13] and [14]. Here we provide 
some key insights into the nature of the optimal solutions 
for both, static and dynamic scheduling, derived in [15]. 

II. System Model 

We consider a network of N battery operated sensor 
nodes strewn uniformly in a coverage area. Time is di- 
vided into slots or rounds. In each slot, sensors take sam- 
ples of the coverage area and transmit the information 
directly to the base station. We model the sensor readings 
at node A; by a discrete random process representing 
the sampled reading value at node k in the n th time 
slot. We assume that sensor readings in any time slot are 



spatially correlated. We ignore temporal correlation by 
assuming that sensor readings in different time slots are 
independent. However, temporal correlation can easily be 
incorporated in our work for data sources satisfying the 
Asymptotic Equipartition Property (AEP). The entropy 
of X% is denoted by hk- 

Initially, sensor node k, 1 < k < N, has access to 
units of energy. The wireless channel between sensor k 
and the base station is described by a path loss factor dk, 
which captures various channel effects such as distance 
induced attenuation, shadowing, and multipath fading. 
For simplicity, we assume d^s to be constant. This is 
reasonable for static networks and also in the scenarios 
where the path loss parameter varies slowly and can 
be accurately tracked. The network operates in a time- 
division multiple access (TDMA) mode. In each slot, 
every sensor is allotted a certain amount of time during 
which it communicates its data to the base station. 

The general problem is to find the optimal rate (the 
number of bits to transmit) and transmission times for 
each node, which maximize network lifetime. Both the 
rate and time allocation are constrained. The rate alloca- 
tion should fall within the Slepian-Wolf achievable rate 
region and the sum of transmission times should be less 
than the period of a time-slot (which is taken to be unity). 
Finding the optimal rate allocation is a computationally 
challenging problem as the Slepian-Wolf achievable rate 
region for N nodes is defined by 2^ — 1 constraints. 
We simplify the problem by insisting that decoding at 
the base-station be instantaneous in the sense that once 
a particular node has been polled, the data generated 
at that node is recovered at the base-station before the 
next node is polled. This reduces the rate allocation 
problem to finding the optimal scheduling order, albeit 
at some loss of optimality. This loss of optimality occurs 
because our problem formulation assumes the separation 
between source and channel coding and it is well-known, 
[16], that the source-channel separation does not hold 
for the multi-access source-channel coding problem and 
Slepian-Wolf coding followed by channel coding is not 
optimal for the joint source-channel coding problem. 
Also, in general, turning a multiple-access channel into 
an array of orthogonal channels by using a suitable MAC 
protocol (TDMA in our case) is well-known to be a 
suboptimal strategy, in the sense that the set of rates 
that are achievable with orthogonal access is strictly 
contained in the Ahlswede-Liao capacity region [17]. 
However, despite this fundamental sub-optimality, we 
argue like [18], [19] that there are strong economic 
gains in the deployment of networks based on such 



technologies, due to the low complexity and cost of 
existing solutions, as well as available experience in the 
fabrication and operation of such systems. 

Let n be the set of permutations of the set, 
{1, 2, . . . , N}. The polling schedule followed by the 
network in any time slot corresponds to a permutation, 
7r G II, |n| = N\. Let n(k) denote the k th node to 
be scheduled. Instantaneous decoding implies that the 
amount of data to be transmitted by node ir(k) is the 
conditional entropy of the data source at node 7r(/c), 
given the data generated by all previously polled nodes. 
We denote the amount of information generated by node 
7r(k) by h n ^y Our aim is to find the scheduling strategy 
(scheduling order and transmission time allocation) that 
maximizes network lifetime. 

III. General Channel Scenario 

In this section, we consider the general channel coding 
scenario where the transmission energy is the convex 
decreasing function of the transmission time. For ex- 
ample, by inverting Shannon's channel capacity formula 
for the AWGN channel, it is straight-forward to show 
that transmission energy is a strictly decreasing convex 
function of transmission time [6]. Other channel coding 
situations lead to a similar result. In such a scenario, 
we not only have to find the optimal scheduling order, 
but also the optimum transmission times for each node. 
We consider two kinds of schedules, namely, static and 
dynamic. In static scheduling, the nodes follow the same 
fixed scheduling order in all time slots until the network 
dies. Under dynamic scheduling, we allow nodes to 
collaborate further by allowing them to employ different 
schedules in different time slots. More specifically, it 
is offline dynamic scheduling, where before the actual 
operation of network starts, the base-station has already 
computed the optimum set of schedules and the number 
of slots for which each schedule is used, rather than 
online dynamic scheduling, where only at the beginning 
of every polling slot, the base-station computes on fly 
the optimum schedule for that time-slot, based on its 
knowledge of the latest state of the network. 

Let f(h, x)d be the energy required to transmit h bits 
of information in x units of time with path loss factor 
d. So, we can interpret f(h,x) as the energy required 
to transmit h bits of information in x units of time with 
unit path loss. Based on our discussion, we model the 
energy function f(h,x) as follows. 

1) f(h, x) is a strictly decreasing continuous positive 
convex function in x. 

2) lim x ^ f(h, x) = oo 



Unless stated otherwise, we assume f(h, x) to be 
the one that is obtained by inverting Shannon's AWGN 
capacity formula [6], that is: 



f{h,x) = x(2- -1) 



(1) 



A. Static Scheduling 



In static scheduling, each permutation, tt € II 
corresponds to a TDMA schedule. Let h^OA = 
H(X 7T ( k - j \X n ( k _ 1 -j, . . . ,X n (i\) denote the number of in- 
formation bits transmitted per slot by node k under 
the schedule 7r G IT. Let t n r k -\ be the corresponding 
transmission time alloted to node k and L n be the 
lifetime achievable by the system under the schedule tt. 
Note that lifetime is integer-valued but we will treat it as 
a real number. The optimal static schedule is the solution 
to the following optimization problem. 



max L n ( 
Tren 
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i<k<N f(K {k) ,t^ k) )d k 



(2) 



*7r(Jfe) ^ 



N 

Yl ^( fc ) = 1 

k=l 

However, using the "Channel Aware" algorithm pro- 
posed in [15] for every schedule it, we have the maxi- 
mum lifetime and the corresponding transmission times 
allocation vector {t n ( k )} k=v Further, for this transmis- 
sion time allocation vector, all the sensor nodes achieve 
the same lifetime. So, the problem in (fj) reduces to the 
following optimization problem: 



maxL n ( 
vrgn 



Ek 



f(K(k),t n ( k ))d k ' 



(3) 



E 

k=l 



t ir(k) 



ir(k) 
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Before we analyze the solution of the optimal schedul- 
ing problem for the general case, it should be noted 
that given the cooperative nature of the sensor nodes, 
the nodes can collaborate with each other to a much 
greater extent by varying their transmission times. For 
example, nearby nodes can finish their transmissions 
sooner, allowing far away nodes more time to transmit 
in order to improve system lifetime. The structure of 
the problem in (f2]) or (0) is such that its computational 
complexity depends on the sensor node data correlation 
structure as well as the energy function. The following 
three examples amply illustrate this. 




Distance between nodes i and j : d;j 

Fig. 1. Data Correlation Model for n — 5: plot of B(Xi/Xj) versus 
cl i V 



Example 1: Let us consider the following model for 
spatial correlation of the sensor data. 

Let Xi be the random variable representing the sam- 
pled sensor reading at node i S {1, . . . , N} and B(Xi) 
denote the number of bits that the node i has to send 
to the base-station. Let us assume that each node i has 
at most n number of bits to send to the base-station, 
so B(Xi) = n. However, due to the spatial correlation 
among sensor readings, each sensor may send less than 
n number of bits. Let us define a data-correlation model 
as follows. 

Let dij denote the distance between nodes i and j. Let 
us define B(Xi/Xj), the number of bits that the node i 
has to send when the node j has already sent its bits to 
the base-station, as follows: 



B(Xi/Xj) 



\dij 
n if d. 



if dij < n 

> n 



(4) 



Figure [TJ illustrates this for n = 5. It should be noted 
that when \dif\ < n, the data of the nodes i and j 
differ in at most B(Xi/Xj) least significant bits. So, the 
node i has to send, at most, \ckj] least significant bits 
of its n bit data. Also note the following property of the 
correlation model: 



B(Xi/X u . . . ,X f _i) < BlXi/Xi-x) 



(5) 



However, the definition of the correlation model is 
not complete yet and we must give the expression for 
the number of bits transmitted by a node conditioned on 
more than one node already having transmitted their bits 
to the base-station. There are several ways in which this 
quantity can be defined. Presently, let us consider the 



following definition of the conditional information: 

B{Xi/X x , = min B(Xi/Xj) (6) 

i<j<% 

Part 1: Here let us assume that the ratio of energy of a 
node and path-loss between base-station and the node is 
equal for all the nodes. So, without the loss of generality, 
for every sensor k, we can put E^/d^ = 1, 1 < k < N. 

This assumption is only to simplify the solution, yet it 
is not such an unrealistic assumption when we consider 
the scenarios such as one where given the equal energies 
of all the nodes, the distance of the base-station from 
any node is much more than the distance between any 
two nodes. Also, when we have the roving base-station 
for the data gathering, this assumption holds good. So, 
using the energy consumption model of dT), this makes 
the time to transmit depend only on the number of 
bits that a node has to send to the base-station, so a 
sensor polling schedule that results in larger value of 
B(X\, X%, . . . , Xn), will also result in the larger value 
of the sum of transmission times of all the nodes. 

Theorem 1: A greedy scheme that assigns informa- 
tion bits to the nodes according to (©, gives the solution 
for the static scheduling problem in © in V. 

Proof: Start with any node as the first node of the 
schedule, then choose the next node in the schedule to 
be that node that minimizes the conditional number of 
bits. However, given the definition of correlation model 
in (01) and ©, this amounts to finding the nearest node. 
So, the schedule that selects the nearest neighbor as the 
next node to be polled is the optimum schedule. We 
call this algorithm: Nearest Neighbor Next or NNN. For 
a desired value of network lifetime, the NNN schedule 
will give the smallest value of B(Xi,X2, . . . , Xn) and 
the smallest sum of the transmission times, so using the 
"Channel Aware" algorithm proposed in [15], we can 
prove that this schedule maximizes the network lifetime. 

It should be noted that it is the special property 
of this problem, due to the correlation model and 
assumption above, that the schedule that minimizes 
B(Xi, X2, ■ ■ ■ , -Xjy), also minimizes the sum of the 
transmission times, and subsequently maximizes the net- 
work lifetime. In general, it is not true that the schedule 
that minimizes B(X\,X2, ■ ■ ■ , -Xjv) also minimizes the 
sum of the transmission times. ■ 

Part 2: Without the assumption in Part 1, here we 
consider the general problem. The following theorem 
proves that for the given spatial correlation model, the 
problem is A^P-hard. 

Theorem 2: The static scheduling problem in ® is 
NV-haxd for the spatial correlation model of ((6]). 



Proof: An arbitrary instance of "Shortest Hamilto- 
nian Path" problem can be reduced to this problem by 
following the same sequence of steps as in the proof in 
Example 2. ■ 
Example 2: Let us consider the spatial correlation 
model of [14], that is the one where the sensor data is 
modeled by Gaussian random field. Thus, we assume 
that an N dimensional jointly Gaussian multivariate 
distribution /(X) models the spatial data X of A sensor 
nodes. 

where K is the (positive definite) covariance matrix 
of X, and \i the mean vector. The diagonal entries 
of K are the variances Ka = erf. The rest of K{j 
depend on the distance between the nodes i and j: 
= a 2 exp(-adfj). 

Without any loss of generality, here we use differential 
entropy instead of entropy, as we assume that the data at 
each sensor node is quantized with the same quantization 
step, and under such assumption, differential entropy 
differs from entropy by a constant. 

Theorem 3: The static scheduling problem in ([3]) is 
A/'P-hard for the spatial correlation model of (|7]) and 
energy function f(h,x). 

Proof: Let us consider the decision version of 
this problem: does there exist a schedule tt, for which 
the network achieves the lifetime L, with the following 
constraints? 

f(K(k),Uk))d k < ^,Vk = l...N (8) 

*7T(fc) ^ 

N 
k=l 

We will prove the ATP-hardness of this problem as 
follows. We reduce an arbitrary instance of "Shortest 
Hamiltonian Path Problem" [20] over Euclidean and 
complete graph, which is well known to be MV- 
complete, to some instance of the problem in ([8]). For the 
given instance of SHP problem, interpret the edge cost 
between nodes i and j, as the spatial distance between 
the nodes i and j of our problem. So, as we visit a node 
k in the SHP tour tt, we can compute the conditional 
entropy h^nA of that node using the knowledge of the 
model for spatial correlation among the sensor nodes as 
well as the history of the tour so far. With this computed 
conditional entropy value, using the first constraint of 
dU), we can compute the minimum time t n ^ that this 



node needs to transmit h n ^ bits of information to the in [14]): 
base-station. So, for every schedule it, we can compute 
the sum of the minimum transmission times. 

Let us consider an Euclidean, complete graph of N = 
4 nodes [A, B,C, D] with symmetric edge costs. Let us 
consider two schedules ABCD and ABDC. We have 
taken N = 4 to only illustrate the main idea of the 
reduction, otherwise the approach is general enough to 
be applicable to the bigger networks. Let us assumes 
that the length of Hamiltonian path cIabcd for schedule 
ABCD is less than (Iabdc f° r the schedule ABDC. 
Now we are going to prove that the sum of transmission 
times for the schedule ABCD is less than that for 
the schedule ABDC. For the spatial correlation model 
of interest (fT]), for every schedule, we can compute 
the conditional entropies of every node based on all 
the nodes visited previously [17]. For example, if the 
schedule is [A, B, C] and Xa, Xb,Xc denote their data 
samples, respectively, then: 



h(X A ) 
h(X B /X A ) 

h(X c /X A ,X B ) 



1 



log(27re), assuming a 2 = 1 



h(X A ,X B )-h(X A ) 
ilog((2vre) det(K AB )) 
h(X A ,X B ,Xc)-h(X A ,X B ) 
1, „„ , det(K ABC ) , 

2 log((27re) ^(^r ) 



where Kabc an d Kab, denote the co variance matri- 
ces of Xa, X B , Xc and Xa,X b , respectively. 

Assume that the transmission time ij of node i £ 
[A, B,C, D] is exponentially dependent on the entropy 
hi of the node (this follows from the empirical results 
obtained after numerically solving the equation £[])). Let 
us denote the transmission times of the nodes A, B, 
C, and D under schedule ABCD as tA,t B ,tc, and to 
respectively. Similarly, for the schedule ABDC, let the 
corresponding times be t' A ,t' B ,t' c , and t' D respectively. 
Note that tA = t' A and t B = t' B . Now 

t A + t B + t c + t D < t' A + t' B + t' c + t' D (9) 
if t c + t D < t' c + t' D (10) 



After substituting the values of tc,tn,t' c , and t' D and 
a little algebraic manipulation of the resulting expres- 
sions, we show that (O is true if (with a and a as defined 



-2ad 2 AC _|_ e -2ad 2 BC > e ~2ad 2 AD + & 



2 " d BD HI) 

if d 2 AC + d 2 BC < Ad + 4d (12) 

if dAB + d B c + dcD < dAB + d BD + dcui 13) 
if d A BCD < d A BDC (14) 



So, if a schedule has smaller Hamiltonian path length, 
then the corresponding sum of the transmission times 
will be smaller too. This implies that the solution of SHP 
gives the smallest value of the sum of the transmission 
times. So, for the schedule that gives shortest Hamilto- 
nian path, we can compute the sum of the transmission 
times and if this sum is less than 1, then we have at least 
one schedule that achieves the lifetime L. ■ 

B. Dynamic Scheduling 

In this section, we explore how network lifetime can 
be increased by employing multiple schedules. Instead 
of restricting the network to follow a single schedule, 
we allow the system to employ different schedules over 
time. There are N\ possible schedules to choose from. 
Let K^qa be the number of information bits generated per 
slot by node k under the schedule ir, 1 < k < N. Two 
or more schedules can collaborate by having the nodes 
use non-optimal transmit energies over two or more data- 
transmission slots to increase the lifetime of the network. 

We have a total of AH schedules. If only m, 1 < 
m < N\ schedules are going to cooperate, then there 
are C(N\,m) possible combination of the schedules. Let 
T- Ki denote the number of time slots for which schedule 
TVi,i G {l,...,m} is employed. The optimal network 
lifetime L under dynamic scheduling is the solution to 
the following optimization problem 



L = max max ^ r OT 

Km<N\ ki."..*m] i=l 



(15) 



s- t. Yl fiK^^n^dkT^ < E k ,Vl <k<N 
i=l 

N 

E ^(fc) = 1,V1 <i<m 



k=l 



Specifically for m = Nl, we have 

N\ 

L = max E T TTi 
i=i 



(16) 



Nl 



s- t. E f(K t (k),tn,(k))dkT^ < E k ,Wl < k < N 



i=i 



N 

Y. t Mk ) = i,vi < i < N\ 

k=l 



Also note that if m = 1, then the problem in (fT5l) 
reduces to the static scheduling problem in ([3]). So, 
the computational complexity of this problem cannot 
be any less than that of the static scheduling problem, 
which is proven to be ATP -hard in Theorem [3] Here the 
question we are concerned with is that if the dynamic 
scheduling can indeed increase the network lifetime. In 
the following, we prove that even for the simplest case 
of the network of two nodes, it is indeed so. 

Theorem 4: For N = 2, dynamic scheduling performs 
better than the optimal static scheduling. 

Proof: For the network of two nodes, let us 
consider two schedules iri where node 1 is polled before 
node 2 and tt 2 , where the nodes are polled otherwise. 
Now using our "Channel Aware" algorithm of the pre- 
vious section, for a given polling schedule we can find 
the optimal allocation of the transmission times such that 
both the nodes spend same amount of energy, dying at 
the same time. Let for schedule -k\, this happens when 
the node 1 transmits for t units of time and node 2 
transmits for 1 — t units of time. Similarly, for schedule 
n 2 , let the corresponding times be t' and 1 — t'. Let h 
denote the entropy of first node polled in the schedule 
and h^ 2 denote the entropy of second node polled. So, 
for schedule tti: h\ = h,h 2 = h 2 \i = hi\ 2 and for 
schedule 7T 2 : hi = hi\ 2 ,h 2 = h. Given the optimality 
of t and il for the schedules tt\ and ir 2 respectively, we 
have for the energy consumptions of the nodes: 

For schedule 7Ti : f{h,t)d\ = /(/ii|2, 1 — t)d 2 (17) 
For schedule vr 2 : f{h x \ 2 , i!)d\ = f(h, 1 - t')d 2 (18) 

If we assume the schedule ir\ to be the optimum static 
schedule, then the following holds true: 

f(h, t)di = f(h 1]2 , l-t)d 2 < fih^dt = f(h, l-t')d. 

Let us consider the plot where the horizontal axis 
corresponds to the energy consumption E\ of the node 
1 and the vertical axis corresponds to the energy con- 
sumption E 2 of the node 2. In this plot, we draw the 
energy consumption curves for both the schedules tti 
and ir 2 , for different values of t,0 < t < 1 and 
t',0 < t' < 1 respectively. Given the form of the 
energy consumption curves, it is easy to verify that these 
two curves corresponding to two different schedules, are 
convex and will intersect at one and only one point. 

Let us consider the "equal energy line" which passes 
through the pair of points (f(h,t)di,f(hi\ 2 ,l — t)d 2 ) 
and (f(hi\ 2 ,t')di,f(h,l — t')d 2 ), so the equation of this 
lines is 

Ei = E 2 (19) 



Now let us also consider a line that passes through 
the point (f(h, r)di, f{hi\ 2 , 1 — r)d%) on the curve 
corresponding to the schedule 7Ti with < r < 1, and 
the point (f{h 1 \ 2 ,s)d\,f(h,\ — s)d 2 ) with < s < 1 
on the curve for schedule tt 2 . The equation for such a 
line is 

E2 = fOh\ 2 , 1 - r)d 2 + m(r, - f(h, r)di (20) 

, , f(h, 1 - s)d 2 - f(hm, 1 - r)d 2 

where m(r, s) = — — 

f{h\2,s)di - f{h,r)di 

Now let us consider the point of intersection of these 
two lines. At the point of intersection, we have: 

Eh = E 2 (21) 

f( h i\2, 1 ~ r )d2 ~ m(r, s)f(h, r)d 1 
1 — m(r, s) 

__ f(h {2 , 1 - r)f(h 1{2 ,s)d 2 - f(h, 1 - s)f(h, r)d 2 
f(hi\2,s) - f(h,r) 

Now, if we want to prove that with the dynamic 
scheduling we can perform better than the static schedul- 
ing, then we must prove that there exists at least one pair 
of values (r, s), for which the following holds 

E 1 (r,s)<f(h,t)d 1 (22) 
E 2 (r,s) <f(h 1]2 ,l-t)d 2 (23) 

Substituting the expressions of E\{r,s) and E 2 {r,s) 
from (|2~TI ) in (1221) . and using the properties of the 
energy consumption function, we prove that the dynamic 
scheduling performs better than static scheduling for all 
(r, s) such that 

r < t (24) 
ht — (h — hi\ 2 ) 
1 < s 

%|2 

■ 

This result implies that two schedule can cooperate to 
give longer network lifetime compared to optimum static 
schedule. Following subsection, generalizes this result. 

C. Geometrical Interpretation 

Let us consider the scenario where we have N sensor 
nodes to poll and this polling can be done in AH ways. 
Let us consider an N dimensional Euclidean space, 
where an axis corresponds to the energy consumption 
of a node. Given that the energy consumption can only 
assume positive real values, we are only concerned with 
the first orthant of this A-dimensional space. For any 
given schedule, as the transmission time allocation to 
the different nodes changes, the corresponding energy 



consumption of the nodes changes. So, the point defin- 
ing the energy consumption of the nodes in this N 
dimensional space describes an iV-dimensional convex 
hypersurface. For N\ possible schedules, we have iV! 
such hypersurfaces. The computational complexity of 
the problem of finding the optimum static and dynamic 
schedules depends on the properties of the intersections 
of these hypersurfaces. Also, note that the general shape 
of these surfaces is determined by the energy consump- 
tion function and the model of the spatial correlation in 
the sensor data. 

The optimal static schedule is the one whose energy 
hypersurface is intersected by the "equal energy line" 
closest to the origin. Further, the dynamic scheduling 
helps us achieve all the points on the convex hull C of N\ 
convex hypersurfaces, as those are all the points achiev- 
able with the cooperation of any number of schedules. It 
is obvious that the network lifetime cannot be increased 
by the cooperation of those schedules and their transmis- 
sion time allocations, which give any point in the interior 
of this convex-hull, as there is always a point on the 
surface of the convex-hull that is closer to the origin and 
gives better network lifetime. So, when two schedules 
cooperate, then the optimum transmission time allocation 
is the one that gives the line connecting the two points 
on the surface of the convex hull. Then the optimum 
network lifetime is achieved by some point on that line, 
specifically by the point on this line that is closest to the 
origin. Similarly, when three schedules cooperate, then 
the respective optimum transmission times allocation for 
the those three schedules is the one that gives the plane 
connecting the energy consumption points corresponding 
to the three schedules, on the surface of the convex hull 
and then the optimum network lifetime is achieved by 
some point on this plane. 

Formally, if m, ■ ■ ■ , 7r m , 7Tj £ II are the m cooperating 
schedules, then the plane defined by the m points on 
the m corresponding hypersurfaces, that is closest to the 
origin, must belong to the convex hull C. For these m 
schedules, the optimal lifetime of the network is obtained 
by that point on this plane that is closest to the origin. 
This is point is obtained as the solution to the following 
optimization problem: 

N m 

argmin JZiY] rjfiK^i), ^(i)K) 2 (25) 

r 1 ,...,r m . . , 

i=l ]=1 

s. t. < rj < 1,V1 < j < m 

m 



With these optimum values of the parameters {rj}™^, 
we solve for the network lifetime as follows: 

Ari...*„ = r min 77TT 1 Y^~ (26 ) 

ie{i,...,N} 2^ j=x rjfiK.^^t^jdi 
s. t. < rj < 1,V1 < j < m 

m 
3=1 

The optimum network lifetime L m for the set of m 
schedules, is obtained by solving above set of equations 
for all C(N\,m) possible combinations of m schedules, 
that is L m is obtained by: 

L m = max L ni ...n m (27) 

[7Ti,...,7r m ] 

7ri,...,7r m GiI 

Further, to obtain the optimum value of network 
lifetime over all possible combinations of the schedules 
is obtained by: 

L = max L m (28) 

l<m<N\ 

It should be noted that the equations ([25l)-(l28]). essen- 
tially solve ( fT5l ). However, this alternative formulation 
of the problem in ( fTBl ), helps us to prove following two 
important theorems: 

Theorem 5: The optimum network lifetime for m 
schedule cooperation is no worse than the optimum 
network lifetime for m — 1 schedule cooperation. That 
is L m > L m _i. 

Proof: Omitted for brevity. ■ 

Theorem 6: When m* = m*(N) < m < N\, then 
the cooperation among m or more schedules does not 
improve the network lifetime anymore. 

Proof: Omitted for brevity. ■ 

IV. Small Rate Region Approximation 

In this section, we assume that transmission rate is 
linearly proportional to signal power. This assumption is 
motivated by Shannon's AWGN capacity formula which 
is approximately linear for low data rates. The energy 
expended by a node to transmit H units of information 
is given by H x d, where d is the suitably normalized 
path loss factor between the node and the base station. 
The linear rate assumption implies, as shown below, 
that transmit energy is independent of transmission time. 
Hence, the optimal time allocation problem is trivial and 
we only need to find the optimal scheduling order. 

For the small data rates, the energy consumption 
function for the k th node under schedule 7r reduces to 
f(K(k),t-K(k))dk = K(k)dk- For example, by inverting 



Shannon's AWGN channel capacity formula, we get the 
following as the energy consumption function [6]: 

For the small data rates, this gives for some constant 

c > 

^Tr(fc) 

f{K(k)-,t-K(k))dk ~ *7r(jfc)(l + c- l)dk 

J Tr(fc) 

= ch^fydk 

A. Static Scheduling 

Under the "small rate region approximation", the static 
scheduling problem in (f2]) reduces to: 

max min — — (29) 

Tren i<fc<Af h nik) d n ( k) 

The objective function represents the lifetime of node 
7r(k) under the given static schedule it. In [15], we 
describe a greedy static scheduling strategy, Minimum 
Cost Next (MCN), and prove its optimality. 

The MCN schedule not only maximizes the minimum 
lifetime, but also maximizes all lifetimes from 2 nd mini- 
mum lifetime to iV th minimum lifetime. This is desirable 
in the situations, where the network has to continue to 
operate even when one or more nodes die out. Also, note 
that the MCN solution is Pareto-optimal. Given an MCN 
schedule, no other schedule can help increase any node's 
lifetime without decreasing some other node's lifetime. 

B. Dynamic Scheduling 

In this section, we explore how network lifetime can 
be increased by employing multiple schedules under the 
small data rate approximation. Under this assumption, 
as the general static scheduling problem in © reduced 
to (|29l , the general dynamic scheduling problem in (fT6l ) 
reduces to the following optimization problem that gives 
the optimal lifetime L 

L = max r 7T (30) 
Tren 

s.t. £ K {k) d k T w <E k , VI < k < N 
Tren 

TV is the number of slots for which the schedule it is 
used. Once more, the constraints ensure that the time 
assignment is feasible for each node with respect to its 
energy capability. Also as in IIII-BI (f30l > can be treated 
as a linear program. A dynamic schedule, r, is given by 
the set {t w }. 

Given N\ variables, in general, there seems to be no 
easy way to solve (l30l to compute the optimal 7v values. 



However, in [15], we exploit the special nature of our 
problem to propose an algorithm, which we refer to as 
Lifetime Optimal Clustering ALgorifhm (LOCAL), and 
prove its optimality. 

C. Geometric Interpretation 

In this section we discuss both the static and dynamic 
scheduling in "small rate region approximation" in the 
spirit of discussion of the nature of the solutions in 
IIII-CI Under this approximation, the energy consumption 
is independent of the transmission time, so the flex- 
ibility to change the energy consumption by varying 
the transmission time is not available. So, in the N- 
dimensional space, where each axis corresponds to the 
energy consumption of a node, for every schedule, we 
get a point in this space, rather than a hypersurface. For 
Nl possible polling schedules, we get ./V! points. 

The optimal static schedule under this approxima- 
tion is given the MCN algorithm. Given the nature of 
the problem in (|29l , this corresponds to that schedule 
which gives a point closest to the "equal energy line". 
This is not difficult to see if one notes that optimal 
static schedule attempts to 'equalize' the lifetimes/energy 
consumption of the nodes. However, as noted above, 
the flexibility of varying the transmission times of the 
nodes to 'equalize' their lifetimes is not available under 
this approximation, so the optimum static schedule does 
the best in providing a point closest to this "equal 
energy line", if not at the line itself. Note that the point 
corresponding to the optimum static schedule may not 
be the closest to the origin. 

Similarly, the optimal dynamic scheduling under 
"small rate region approximation" corresponds to finding 
those schedules, the lines, planes, or the hyperplanes 
connecting which contain the point closest to the "equal 
energy line". When more than one such points are 
possible, then one that is closest to the origin is taken to 
be the point of operation of the network. 

V. Conclusions and Future Work 

In this paper, we have considered the problem of 
maximizing the lifetime of a data gathering wireless 
network. Our contribution differs from previous research 
in two respects. Firstly, we proposed a combined source- 
channel coding framework to mitigate the energy cost of 
radio transmission. Secondly, we have explicitly maxi- 
mized network lifetime as opposed to other objective 
functions such as cumulative energy cost. To the best 
of our knowledge, both these aspects have not been 
explored in the context of sensor networks previously. 



In our system model, nodes communicate directly to 
a base station in a time division multiplexed manner. 
With our notion of instantaneous decoding, we show 
that the network lifetime maximization problem reduces 
to finding an optimal scheduling strategy (polling order 
and transmission time allocation). 

We considered the general channel problem and 
proved that there the optimal static and dynamic schedul- 
ing problems are jVP-hard and the optimal dynamic 
scheduling strategy indeed does better than the optimal 
static scheduling strategy. Then we considered the sce- 
nario where the energy consumption is independent of 
transmission time. For both, the general channel problem 
and its approximated version, we provided the geometric 
interpretation of the optimal solutions and the solution 
search processes. 

This paper assumed that source and channel coding 
is optimal, quantization is perfect and that a continuum 
of power levels can be employed. Network lifetime 
obtained under these assumptions is an upper limit to 
practically achievable performance. It would be useful 
to consider the network lifetime problem with more 
realistic constraints. Finally, the system model has to be 
generalized to the multi-hop case. 
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